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Abstract 

A chord diagram refers to a set of chords with distinct endpoints on a circle. The intersection graph 
of a chord diagram C is defined by substituting the chords of C with vertices and by adding edges between 
two vertices whenever the corresponding two chords cross each other. Let C n and G n denote the chord 
diagram chosen uniformly at random from all chord diagrams with n chords and the corresponding 
intersection graph, respectively. We analyze C n and G n as n tends to infinity. In particular, we study 
the degree of a random vertex in G n , the k -core of G ni and the number of strong components of the 
directed graph obtained from G n by orienting edges by flipping a fair coin for each edge. We also give 
two equivalent evolutions of a random chord diagram and show that, with probability approaching 1, 
a chord diagram produced after m steps of these evolutions becomes monolithic as m tends to infinity 
and stays monolithic afterward forever. 
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1 Introduction 

A chord diagram of size n is a pairing of 2 n given points on a circle. We label the points 1 through 2 n 
clockwise and join each pair of points in the pairing by a chord to obtain a geometric and a combinatorial 
object. It is easy to see that there are (2n)!/(2 n n!) chord diagrams with n chords. 

Although we are mainly concerned about the combinatorics of random chord diagrams in this paper, 
chord diagrams appear extensively in some other fields such as in the study of some invariants in knot 
theory [3, 6, 8, 19, 25], in the representation theory of Lie algebras [11], and in codifying the pairings 
among nucleotides in RNA molecules [10, 21, 22]. For detailed information about chord diagrams and their 
topological and algebraic significance we refer the reader to Chmutov, Duzhin, and Mostovoy’s book [13]. 

Many probabilistic and enumerational problems about chord diagrams have been studied. A remarkable 
formula for the generating function counting chord diagrams with a given genus was given by Harer and 
Zagier [17]. Later, Linial and Nowik [20] and Chmutov and Pittel [14] studied the genus of a uniformly 
chosen random chord diagram. In random graph theory, Bollobas et al. [9] used linearized chord diagrams 
to generate a preferential attachment random graph introduced by Barabasi and Albert [7]. 

The enumeration of chord diagrams was first studied by Touchard in a sequence of papers. In [26], 
he found a functional equation for the bivariate generating function m T njiri x m y n , where T n ^ m denotes 
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the number of chord diagrams with n chords and m crossings. Using this, J. Riordan [23] found an exact 
formula for T n ^ m in the form of an alternating sum. Note that T„ q counts the famous Catalan numbers. 
Using some recurrence relations, Stein and Everett [24] showed that a random chord diagram of size n 
is connected with probability approaching 1 as n —>• oo. Cori and Marcus [15] counted non-isomorphic 
chord diagrams. Acan and Pittel [2] studied a phase transition for the appearance of a giant component 
and found asymptotic estimates for T n m for m = O(nlogn). Various enumerational problems were also 
studied in [1], 

Flajolet and Noy [16] studied a uniformly random chord diagram with n chords and, using generating 
functions, they showed that (i) the number of components (defined formally below) approaches l + Pois(l) 
as n -> oo, where Pois(A) denotes the Poisson distribution with parameter A, and (ii) the number of 
crossings is asymptotically Gaussian. They also showed that almost all chord diagrams are monolithic, 
where a monolithic diagram consists of a root component and a number of isolated chords. 

In this paper, we study several characteristics of a random chord diagram. In particular we extend the 
results of Flajolet and Noy about the components of a random chord diagram in several directions. Before 
we proceed, we note that the results in this paper are presented in graph theory language. The following 
definition allows us to do so. 

Definition 1.1 (Intersection graphs). For each chord diagram C, we define the corresponding intersection 
graph Gc as follows. Each chord in C becomes a vertex in Gc, and two vertices in Gc are adjacent if and 
only if the corresponding chords cross each other in C; see Figure 1. 

The intersection graphs of chord diagrams are related to circle graphs, where a circle graph is the 
intersection graph of a set of labeled chords. (The endpoints of chords are not labeled in this case.) 
Unlabeled versions of these two classes of graphs are the same. Some of the NP-complete problems 
in general graphs, such as finding the clique number or independence number, have polynomial time 
algorithms for circle graphs (and hence for the intersection graphs of chord diagrams). 

By Definition 1.1, any graph theoretic term about a chord diagram can be understood in reference to 
the corresponding graph. For example, components of a chord diagram C corresponds to the components of 
Gc and the k -core of Gc corresponds to the k -core of Gc- Throughout the paper, we denote by C n a chord 
diagram chosen uniformly at random from all chord diagrams of size n and (with an abuse of notation) by 
G n the corresponding intersection graph. 

In section 2 we find the asymptotic distribution of the degree of a random chord in C n , that is, the 
distribution of the number of chords crossing a random chord. In Section 3 we define monolithic chord 
diagrams and we give an alternative proof for the fact that almost all graphs are monolithic, a fact first 
proven by Flajolet and Noy. In Section 4 we study the k -core of C n (equivalently the k -core of G n ) for 
k = o(\/n). In Section 5 we study oriented chord diagrams or equivalently directed intersection graphs. 
In particular, we show that the number of strong components in the random directed intersection graph 
converges in distribution to 1 + Pois(3), a result analogous to the result of Flajolet and Noy for the 
undirected case. In Section 6 we present two models of dynamically growing random chord diagrams. We 
show that these two models are the same in some sense; after n chords are drawn, both of them give 
C n after some canonical relabeling of the endpoints. We show that, during these evolutions, as n —>• oo 
however slowly, a random chord diagram becomes monolithic and then stays monolithic afterward with 
probability approaching 1. 

We conclude this work with a discussion on the independence number of C n . Note that a chord diagram 
can be viewed as a fixed-point-free involution of [2 n\. Using this interpretation and a result of Baik and 
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Figure 1: A chord diagram and the corresponding intersection graph 

Rains [5, Theorem 3.1] about the longest increasing subsequence of a random involution, Chen et al. [12, 
Remark 5.6] determined the asymptotic distribution of the clique number of C n . Their terminology differs 
from ours though. They define r-crossings and r-nestings as follows: a set of r chords (aq, y±), ..., (x r ,y r ) 
is an r-crossing if x\ < ■ ■ ■ < x r < y\ < ■ ■ ■ < y r and it is an r- nesting if aq < • • • < x r < y r < ■ ■ ■ < y\. 
According to this definition, Chen et al. showed that crossing numbers and nesting numbers are distributed 
symmetrically. Recently, Baik and Jenkins [4] proved that cr n and ne n are asymptotically independent, 
where cr n denotes the maximum crossing in C n and ne n denotes the maximum nesting in C n . 

Notation and Terminology 

Here we give the notation and terminology used frequently in the paper. Additional notation and definitions 
will be given as they become necessary in the later section. For two integers a and b, we denote by [a, b\ 
the set {a, a + 1,..., b}. We simply write [b] for [1, b\. For a positive integer k , we denote by (2k — 1)!! the 
product of odd integers from 1 to 2k — 1. 

A chord diagram is a pairing of a set of points, which are referred to as endpoints in this work. Our 
endpoints are labeled with positive integers. Unless otherwise stated, a chord diagram of size n has the 
set of endpoints [2n]. A block refers to a set of consecutive endpoints on the circle. The length of a block 
is the number of endpoints it contains. A subdiagram of a chord diagram C is a subset S of the chords of 
C, where the labeling of endpoints is inherited from C. For example, {(1,4), (3,6)} is a subdiagram of the 
chord diagram given in Figure 1. 

A chord joining the endpoints x and y is denoted by (x,y) or (y,x). A chord (x,y) divides the circle 
into two arcs and hence defines two blocks of endpoints, B\ and B 2 , where B\ and B 2 are disjoint and 
their union gives the whole set of endpoints except x and y. In Figure 1, the two blocks determined by 
the chord (3,6) are {4, 5} and {7, 8,9,10,1, 2}. If the set of endpoints is [2n] and x <y, then the lengths 
of the blocks determined by (x, y) is y — x — 1 and 2n — y + x — 1. 

For a chord c = (x, y) in a given chord diagram C , the length of c, denoted (-(c), is the minimum of the 
lengths of the two blocks determined by c. For example, in Figure 1, the length of (5,9) is 3. According 
to this definition, a chord joining two consecutive endpoints has length 0. The length of a chord can be at 
most n — 1, in which case the chord is (x,x + n) for some x € [n]. 
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2 Degree of a random chord in C n 


Let C be a chord diagram and c = (x, y) be a chord in C. The degree of c in C, denoted degc(c ), is the 
number of chords crossing c in C. In other words, the degree of c in C is the degree of the corresponding 
vertex in the graph Gc- 

Studying the degree of a random chord in C n is equivalent to studying the degree of the chord containing 
the endpoint 1. In this section this chord is denoted by c\. The length of c\ in C n is distributed as 


P(£( Cl ) = k) 


2^1 if 0 < A: < n — 2, 
2S=I Xk = n- 1 . 


The degree of c\ is closely related to l{c\) as we shall see. One obvious observation is that deg{c\) < 
Conditioned on the event (7(ci) = k}, we define X k as the number of chords with both endpoints lying in 
the smaller block determined by c\. If c\ = (l,n + 1), then both blocks have lengths n — 1, in which case 
it does not matter which block we choose. Note that 0 < X k < [k/ 2j. 

Example. For C = {(1, 8), (2,4), (3,11), (5, 7), (6, 9), (10,12)}, we have £(ci) = 4 and X 4 = 1. 

Now, we compute the mean fi k := E[X/. 0 ] and the variance a 2 := E[A'|] — y 2 . To find these, we can 
assume without loss of generality that c\ = (1, k + 2). For the mean, we write 

k'k 'y ] ) 

O ,y) 


where the sum ranges over all x and y such that 2 < x < y < k, and l{ x ,y) is th e indicator random variable 
for the existence of the chord ( x,y) conditioned on c\. Thus, 


E[l(x,y)] = E((x,y) is a chord in C n \ c\ 


(1, k + 2)) 


(2n- 5)!! 
(2n — 3)!! 


1 

2n — 3’ 


and consequently 


fk\ 1 _ k(k- 1) k 2 

\2 ) 2n — 3 _ 2(2n - 3) ~ An 


( 2 . 1 ) 


as k and n tend to infinity. Thus, as n tends to infinity and for k = o( v / n), with high probability 1 (whp), 
the degree of ci is k, conditioned on l(c\) = k. To compute the variance, first we write 


E[X k (X k - 1)] 


E E P<: 


x,y)^-(u,v] 


H 


k — 2\ (2n 
2 ) (2n 


7)!! 

3)!!' 


Using the last two equations we get 

= E[X k (X k - 1)] + E[X k ] = + 

4(2n — 3j(2n - 

x We say that an event E that is defined for all n € (1, 2,... } occurs “ with high probability'" , abbreviated as whp, if the 
probability of E approaches 1 as n tends to 00 . Thus, even if it is not explicitly stated, we always have the assumption 
“n —> 00 ” when we use the term “whp”. 


E[X 2 ] 
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and 


2 _ k{k - 1) 

a k — 


k 2 — 5k + An — 4 k(k — 1) 


2(2 n - 3) V 2 (2n - 5) 
k(k — 1) 


2(2n - 3) 


[(2 n - kf - lOn + 5A: + 6] < 


k 2 (2n — kf 


2(2n- 3) 2 (2n - 5) L J 

Theorem 2.1. Lei D(n ) = deg{cf)/n and let b be a constant such that 0 < b < 1/2. We have, 


( 2 . 2 ) 


lim P (D(n) < b) = 1 — \/l — 26. 

n—> oo 


/n other words, Din ) converges in distribution to a random variable with density function (1 — 2x) 1 / 2 in 
the interval [0,0.5]. 

Proof. We want to show that lin^^oo F(deg(c\) < bn) = 1 — y/l — 2b. Note that, conditioned on i{cf) = k, 
the degree of ci is equal to k—2X k . The mean and the variance of X k together imply that X k is concentrated 
around its expected value n k . More specifically, by Chebyshev’s inequality and equations (2.1) and (2.2), 


P 


\X k -n k \ >^=)< 

n j uj 


a 2 k n (2 n - kf 

2 k 2 


n 2 w 2 


4 

- "uj 2 


(2.3) 


for any uj approaching infinity. In the rest of the proof, we let uj = log n although any u j approaching oo 
sufficiently slowly would work. Now fix a constant b E (0,1/2). We have 


71—1 

F{deg{c i) < bn) = J^P(fc - 2X k < bn \ t(c{) = k) ■ P(£(ci) = k) 
k=0 

2 n ~ 2 

= t, -- 2Xfc < bn \ £(ci) = k) 

In — 1 

k =o 

H-• P(n — 1 — 2X n _i < bn I £(c\) = n — 1). 

2n — 1 

Clearly, the last term in this equation is of order 0(l/n). Noting that the random variable X k is defined 
conditional on the event {£{cf) = k}, we write P ((k — bn)/2 < X k ) instead of P (k — 2X k < bn \ £{cf) = k) 
for simplicity. Thus, 

2 / i. _ \ 

F(deg(a) < bn) = ■ ^ P ( —-— < X k J + 0(l/n) 

71 k =0 ^ ' 

= /'£ p (^ sx ‘) +0(1/n) ' 
k =0 V ' 

Next, we split this sum into two as 


71—2 

E p 

k =0 


k — bn 


71—2 


<x k = ^2 p ( - < x k 


k =0 


k — bn 


71—2 

+ D 

k =0 


k — bn 


| X k /ifc | E 


k log n 


n 


X k fj. k | E 


klogn 


n 


<X k 


I X k fa k | > 


k log n 


n 


| X k fj k \ > 


k log n 


n 
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Since P(|X*, — n k \ > klogn/y/n) = 0((logn) 2 ) by (2.3), the last sum above is 0(n(logn) 2 ). On the 
other hand, the first sum on the right side of the equation is equal to 


n —2 


(1 - 0(l/(logn) 2 )) 


k =0 


k — bn 


<X k 


\X k fj , k | ^ 


k log n 


n 


(2.4) 


Note that 


P 


k — bn 


<X k 


| X ,- W |<^|=0 


n 


unless k belongs to A, where 

f k — bn k log n 
A := { k : —--— < fJ, k 


2 y/n 

Thus, the sum in Equation (2.4) is bounded above by the size of A, i.e., 


(2.5) 


n —2 

E 1 

k =0 


k — bn 


<X k 


\X k -» k \<^)<\A k \. 


( 2 . 6 ) 


Now let t := k/n, where t € [0,1]. Using (2.1), the inequality k 2 fcrt — fc ^| n < fi k can be written as 


t — b flogn t(tn — 1) 

2 y/n 2(2 n — 3) — 

Multiplying by 4 and rearranging, we get 


-tr + 2t-2b- 


4 1 log n 4 1 — 6 1 


+ 


n 


4n — 6 


< 0 . 


(2.7) 


The roots of the equation —t 2 + 2f — 2b = 0 are 1 — \/l — 2b and 1 + y/1 + 2b. Moreover, the left side of 
this equation is negative for t € [0,1 — \/l — 2b) and positive for (1 — \/l — 2b , 1]. Thus, by (2.7) 

\A\ < (1 - Vl - 2b + e n ) n 

for any e n —> 0 + as long as log n/(e n y/n) —> 0. Consequently, 

P(de< 7 (ci) < bn) < (l — O (l/(logn) 2 )) • ^1 - Vl — 2 ~bj + O (l/(logn) 2 ) 

= 1 - y/1 - 26 + O (l/(log n) 2 ) . (2.8) 


To find a lower bound on 

n —2 


E p 

k =0 


k — bn 


<X k 


\X k [A k \ E 


k log n 
\fn 


we note that 


P 


k k — bn 


<X k 


x k — /Xfcl < 


k log n \ 


= 1 
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if k lies in the set B, where 


B : = 



k — bn + k log n 


n 



Thus, 


(2.9) 


E r(^.<x k 


k =0 


1-U - „ t \ < > Ifll 

v n 


( 2 . 10 ) 


Now we need to estimate the size of B. As in the case of A, we let 
definition of B in terms of t as 


-t 2 + 2 1 


, 4 1 log n At — 6t 2 

2b + —^ + ~A — 

y/n 4n — o 


< 0. 


k/n and write the inequality in the 


( 2 . 11 ) 


For 0 < t < 1 — y/l — 2b — e n , the inequality in (2.11) is satished as long as e n goes to 0 sufficiently slowly. 
In particular, (2.11) holds for e n = l/(logn) 2 and large enough n. Consequently, analogous to (2.8), we 
have 


P (deg(c) < bn) > 1 — y/l — 2b — O (l/(logn) 2 ) . (2.12) 

Finally, combining (2.8) and (2.12), we get 
lim P (deg(c) < bn) = 1 — y/l — 2b, 

n^t- oo 

which finishes the proof. □ 

3 Monolithic chord diagrams 

Monolithic diagrams were introduced by Flajolet and Noy [16]. Using generating functions, they showed 
that almost all intersection graphs of chord diagrams have one large component and some isolated vertices. 
Here we give an alternative proof to this result and in the subsequent sections we give several extensions. 

Definition 3.1. A chord that connects two consecutive endpoints is called a simple chord. Thus, a simple 
chord is of the form {i,i + 1) for some i € [2n] (addition is always in modulo 2 n). The component containing 
endpoint 1 is called the root component. A chord diagram C is monolithic if 

(i) C consists only of the root component and simple chords, and 

(ii) there is no pair of simple chords next to each other in C, that is, there is no i € [2n] such that both 
(i, i + 1) and (i + 2, i + 3) are chords in C. 

The original definition given by Flajolet and Noy does not have part (ii) but we include that for 
convenience in the next sections. As we will see it is not much of a restriction. 

Theorem 3.2 (Flajolet and Noy). Whp, C n is monolithic. Moreover, the number of simple chords in C n 
approaches in distribution a Poisson random variable with parameter 1. 
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Proof. If a chord diagram C is not monolithic, then for some 2 < k < n/2, there is a set of k chords whose 
endpoints form a block of length 2k. Let B^ denote the number of such sets of k chords in C n . We want to 
show that ^2 Bk —>• 0, where the sum is over all integer k such that 2 < k < n/2. By Markov’s inequality, 
it is enough to show —>• 0. Since there are 2 n blocks of 2k endpoints, we have 

E[B t ] = 2 n ( 2 *-iy ! < ilB -»- 1 > !! , 

1 J (2n — 1)!! 

and consequently, 

Ln/2j 

^2 = 2n 
k =2 

Note that, since 

E [B k ] _2n-2k-l 

+ 1 

E[Sfc] is decreasing with k for < |_n/2j — 1. Hence, 

K 2 J „ 

bn 


L ^ J (2fc- l)!!(2n — 2k — 1)!! 

k=2 


(2 n - 1 )!! 


E E ™= E ™ + <"/ 2 > E ^] s ( 2„-i)(2n-3) + (n/2)(2n) ^|f=Tyr = 0 


k =2 


n 


which shows that C n is monolithic whp. Finally, the next lemma shows that the number of simple chords 
converges in distribution to Pois(l) as n —>• oo. □ 

Let Lj the number of length j chords in C n . A chord c has length j if c = (i,i+j + 1) for some i G [2n], 
where the addition is in modulo 2 n. Note that Lq counts the simple chords in C n . 

Lemma 3.3. As n tends to infinity, the random variable Lj converges in distribution to a Poisson random 
variable with mean 1, for any 0 < j < n — 2. 


Proof. We compute the factorial moments E r , where E r := E 
event {(i,i + j + 1) G C n }. We have 

E r = ^2 F(^j t = 1 for all t G [r]). 

l<2i<Z2*”<v^2n 

For a tuple (*i,..., i r ) in the sum, we have 


("/) 


Let fi denote the indicator of the 



(2n—1)!! 


if if, = i a + j + 1 for some a, b G [r], 
otherwise. 


Hence, 


_E r < 


2?r\ {2n-2r- 1)!! 
J (2n - 1)!! 


(3.1) 


(3.2) 


r j [Zn — lj!' r! 

If the probability in (3.1) is 0, we call the tuple a bad tuple. We obtain an upper bound on the number 
of bad tuples by choosing an endpoint i, then choosing the endpoint i + j + 1, and then choosing r — 2 
endpoints from the remaining 2n — 2 endpoints. This can be done in 2n( 2 f f (J 2 2 ) ways. Thus, 


E r > 


2 n 
r 


— 2 n 


2n — 2 
r-2 


[2n-2r- 1)!! 
(2 n- 1)!! 


r\ 


(3.3) 


By (3.2)-(3.3), the binomial moments of Lj converge to those of a Poisson random variable with mean 1 
and hence Lj converges in distribution to a Poisson random variable with mean 1. □ 















4 The fc-core of C n 


The k- core of a chord diagram C is the largest subdiagram S of C with the property that each chord in S 
crosses at least k other chords in S. This means that the minimum degree in the intersection graph of S 
is at least k. In this section we study the size of the k-cove of C n . 

For a chord diagram C, we denote by C>k(C ) the subdiagram of C consisting of the chords with lengths 
of at least k. For the simplicity of notation, we write £>*, instead of £,>k(C n ). We say that two chords of 
a chord diagram are neighbors if they cross each other. 

Theorem 4.1. Let k be a function of n such that k 2 = o(n) as n —>• oo. Then, as n — >• oo, the minimum 
degree in the subdiagram C>k is at least k whp. Consequently, as n —» oo, the k-core of C n is the 
subdiagram C>k whp. 

Proof. Since for any chord c we have deg(c) < £(c), the k- core of C n is a subset of C>k- Hence, for k as in 
the theorem, we only need to show that each chord in C>k has at least k neighbors from C>k- 

Let us call a chord (x,y) of C>k a bad chord if (x,y) has fewer than k neighbors in £>&. It is enough 
to show that the expected value for the number of bad chords tends to 0. For any t > k, let r = r(t) be 
the first integer larger than (t — k)/ 2 , i.e., 

r = r(t) := [{t — k + 2)/2j. (4.1) 


Thus, r{t + 1) = r(t) if (t — k ) is even and r(t + 1) = r(t) + 1 otherwise. 

Any chord with one endpoint in [i + 1, i + t] and one endpoint outside [i — k, i + t + 1 + k] has length 
at least k. Hence, if the chord c = (*, i +1 + 1) is bad, then, other than c, there are at least r chords with 
both endpoints in [i — k, i + t + 1 + k]. So the number of bad chords of length t is bounded above by the 
number of pairs (c, {ci,..., c r }), where c = (i,i + t + 1) € C n for some i, {ci,..., c r } is a set of chords such 
that all the endpoints of Cj’s lie in [* — k, i +1 + 1 + k\, and c 0 {ci,..., c r }. For t > k, let E t be the sum of 
expected values of such pairs as i varies from 1 to 2n. Since there are 2 k + t endpoints in [i — k, i +1 +1 + k] 
other than i and i +1 + 1, we have 


/ 2k + t\ (2 r - l)!!(2n -2 r- 3)!! 
V 2 r ) (2n- 1)!! 


(4.2) 


We will show that Y^t=k —■t 0 as n —> oo. First, by (4.1) we have t = 2r + k — 2 or t = 2r + k — 1, 
depending on the parity of t — k. In particular, we have t < 2r + k for any t. Using this upper bound for t 
in (4.2), we write 


/3k + 2r\ (2r - l)!!(2n - 2r - 3)!! 
V 2 r ) (2n- 1)!! 


Since r is increasing with t, the maximum value of r occurs at t = n — 1 and r(n — 1) = [(n + 1 — k)/2\. 
Let N = [(n + 1 — fc)/2j. Since each r (except possibly for r = N) has two preimages, we write 


71 — 1 


N 


Y J Et<2{2n) ^ 


/3k + 2r\ (2r — l)!!(2n — 2r — 3)!! , 

V 2r ) (2n- 1)!! “ '" 


N 


E F r 


where 

_ /3k + 2r\{2r-l)\\(2n-2r-3)\\ 
r V 2r ) (2 n - 1)!! 
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An easy calculation shows that F\ < 2 k 2 /n 2 . We will show that this is the main contribution to the sum. 
In order to see how the terms F r are changing, we take the ratio of two consecutive terms and obtain 


F r _ (2r + 2)(2n-2r-3) 

F r+ 1 (3k + 2 r + 2) (31c + 2r + 1) 


From this fraction we see that the terms F r are initially decreasing rapidly, and as r gets closer to the 
upper bound of the sum they almost stabilize. To be more precise, we divide the set [1,1V] into three 
disjoint pieces I\,I 2 , and I 3 such that 


I 1 =[1,N 1 ], I 2 =[N 1 + 1 ,N 2 ], I 3 =[N 2 + 1,N], 

where N\ = [u/10j and N 2 = |_4n/9_|. Using (4.3), it is easy to verify that 


7 r > < 


8 , for r€/i, 
1 , for r € I 2 , 
1 / 2 , for r € I 3 . 


In the first interval the terms are decreasing faster than a geometric sequence with ratio 1/8, so we have 
F Nl < ( 1 / 8 ) JVl " 1 Fi and 

F r < 2F\. 

In the second interval the terms are decreasing, so we have Fjy 2 < F^ and 
Y F r < n ■ F Nl < ?r(l/ 8) iVl " 1 • F x . 

rg/2 

In the last interval we just bound the sum above by a geometric series with a constant ratio 2. As a result, 
we have 


Y Fr = 0(F N2 ■ 2 N ~ N2 ) = 0(F 1 ■ (l/ 8 ) Nl ■ 2 N ~ N2 ) 

Combining all three sums, we get 

n 

YF r = 0(F 1 ) = 0(k 2 /n 2 ). 

r =1 

Consequently, 

n— 1 

YFt = 0 (k 2 /n), 

t=k 

which goes to 0 by the assumption of the theorem. □ 

Recall from Section 3 that Lj denotes the number of chords of length j. The next result is an extension 
of Lemma 3.3. 
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Theorem 4.2. Let k be any nonnegative fixed integer. We have 
(L 0 , Lx,... ,L k ) (T 0 ,Ti,... ,T k ), 

where T t ’s are independent copies of the Poisson random variable with parameter 1. 

Proof. We will show that the factorial moments converge to those of independent Pois(l) random variables. 
Let r = (ro,..., r k ) be a vector of nonnegative integers, and let s = Yli=o r *- 


E r := E 


Lq 

ro 


Lx 
r l 


L k 

r k 


To choose a chord of length j, it is enough to choose the initial endpoint; if i is the initial endpoint, then 
the chord of length j will be (i,i + j + 1 ), where we take the value of i + j + 1 in modulo 2 n in case 
i + j + 1 > 2 n. Hence, 


E r < 


2 n 
s 


3 \ (2n - 2s - 1)!! 

P’o, • • •, r k ) (2n — 1 )!! 

On the right side of this inequality, the first factor is an over-count for the number of ways to choose the 
initial endpoints of the chords. Once we choose the initial endpoints of the chords, we can partition them 
into k + 1 sets, where the endpoints in the jth set will correspond to the initial endpoints of length-j 
chords. Next, there are (2n — 2s — 1)!! ways to pair the remaining endpoints. The denominator (2n — 1)!! 
is the number of all chord diagrams. After cancellations, we can write the right side of Equation (4.4) as 


/ k \ /s-l 

nil- IT 


\i =0 


0=0 


2 n-j 
2 n - 2j - 1 


= 1 + 0 


k 

ni 


(4.5) 


i=0 


For a lower bound on E r , consider the points satisfying the inequalities 

1 < i\ < — (k + 1) < • • • < i s — (s — 1 )(k + 1) < 2n — s(k + 1). (4-6) 

Note that any s-tuple (ii, ... ,i s ) satisfying (4.6) can be the initial endpoints of the s chords, regardless of 
how they are partitioned into k + 1 sets of cardinalities ro,..., r k . Introducing j t = i t — (t — l)(k + 1), we 
have 

1 < ji < h < ■ ■' < Js < 2 n - s(k + 1 ), 

and hence the number of ways to choose the initial points meeting the condition in (4.6) is ^ 2n- ^ fc + 1 ))_ 
Thus, 


E r > 


2 n — s(k + 1 ) 


s \ ( 2 n — 2 s — 1 )!! 
r 0 ,...,r k ) ( 2 tt, — 1 )!! 


Similar to (4.5), for the right side of the above inequality, we have 


ii 'Mu 


^*=0 


u=° 


2 n — s(k + 1 ) — j 
2 n - 2j - 1 


= 11 + 01 — 

n 


k 

IIV 


(4.7) 


i =0 


Combining the upper and lower bounds for E r , we get 
k 


E r 




i=0 


from which the desired result follows. 


□ 
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The next theorem is an extension of Theorem 3.2, which gives the 1-core of C n . 

Corollary 4.3. For any fixed positive integer k, the size of the k-core of C n converges in distribution to 
n — Pk, where Pj~ is a Poisson random variable with mean k. 


Proof. By Theorem 4.1, the subdiagram £>& is the k-core of C n whp. By Theorem 4.2, the number 
of chords with lengths smaller than k, i.e., the sum 'ffi J > jf2 )Lj, converges in distribution to the sum of 
k independent Pois(l) random variables. The sum of independent Pois(Aj) random variables is another 
Poisson random variable with mean ^ ■ A j, which finishes the proof. □ 

Next we find an upper bound for the number of ‘small-length chords’. For a positive integer k, let 

z k = Ej=o L r 

Lemma 4.4. If k —> oo and k < n — 1, then Z^/k —>• 1 in probability. 


Proof. We find the first and the second moments of Z^ and apply Chebyshev’s inequality. For j < n — 1, 
there are exactly 2 n chords of length j, which are (i,i + j + 1) for 1 < i < 2 n. Each of these chords exists 
in C n with probability l/(2n — 1). Then, E [Lj\ = 2n/{2n — 1), and consequently 


= ^ T =* +S ^. 

3=0 

Now, E[Zf] = E [Z k (Z k - 1)] + E [Zk] and 

^[ Z k( Z k — 1)] = ^2 E[l( ai6 )l( C)d )], 

( a,b),(c,d) 


(4.8) 


where the sum is over all pairs of distinct chords of lengths at most k — 1. A chord of length at most k — 1 
can be chosen in 2 nk ways. Once such a chord, say (a, b), is chosen, there are 2 nk — (4 k — 1) ways to choose 
another chord (c, d) with length at most k — 1 since {a, 6} n {c, d} = 0. After these two chords are chosen, 
the rest of the endpoints can be paired in (2n — 5)!!, so 

2nk(2nk — Ak + l)(2n — 5)!! 2nk(2nk - Ak + 1) 

nz k (z k - 1)] =-pswryi!-= (2 n - 1)(2» - 3) ' (4 ' 9) 

Using (4.8) and (4.9), after straightforward computations we get 
a 2 :=E [Zl}-p 2 = k + o[^j. 

Let e be a positive constant. Using |/i — k\ < 1 and Chebyshev’s inequality, 

P(|Z„ - *| > ek) < P(| Z t - „| > ek - 1) < = 0 (I) • 

which finishes the proof. □ 


Lemma 4.4 combined with Theorem 4.1 gives us the approximate size of the fc-core in C n for k growing 
to infinity slowly enough. 

Corollary 4.5. Let k —» oo slowly enough so that k 2 /n —>• 0 as n tends to infinity. Let Rk denote the size 
of the k-core in C n . Then, (n — Rk)/k —>• 1 in probability as n —>• oo. □ 
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5 Directed intersection graphs 


In this section we consider chord diagrams whose crossings are oriented. For any chord diagram C, we think 
of the chords as pieces of strings. If two chords c and d of C cross each other, then either c over-crosses 
d, or d over-crosses c. If c over-crosses d, then we also say that d under-crosses c, and vice versa. With 
this interpretation, each crossing can be oriented in one of two ways. Given a chord diagram C with m 
crossings, there are 2 m orientations of C. 

The directed intersection graph Gx> is obtained from an oriented chord diagram T> as follows: if a chord 
c € V over-crosses a chord d € V, then the edge joining c and d in Gx> is directed toward the vertex d. We 
say that an oriented chord diagram V is strongly connected if Gx> is strongly connected as a graph. 

We denote by D n an oriented chord diagram obtained from C n by choosing one of the possible orien¬ 
tations uniformly at random. More precisely, we choose a chord diagram uniformly at random from all 
chord diagrams, and then we orient each of its crossings in one of the two possible ways by flipping a fair 
coin. If T> is an oriented chord diagram, then 

P(T> n = V) = ^ 2 n _ l y A 2 cr(V) ’ 

where cr{T>) denotes the number of crossings in V. 

The Flajolet-Noy Theorem (Theorem 3.2) tells us that almost all (unoriented) chord diagrams are 
monolithic. Consequently, as n —» oo, whp, the intersection graph of C n has a giant component of size 
n — X n , where X n is the number of isolated vertices. The same theorem also gives that X n converges 
in distribution to Pois(l). Here we will prove an analogous theorem for the random directed intersection 
graph. We first introduce some terminology that will be helpful later. 

Let V be an oriented chord diagram, c be a chord belonging to V, and S' be a subset of the chords of V, 
where S does not contain c. We say that c over-crosses (under-crosses) S if c over-crosses (under-crosses) 
any chord d of S that it crosses. When c over-crosses S in T>, in the intersection graph Gx> any edge cd 
with d € S is directed toward d. Similarly, when c under-crosses S in D, in the intersection graph Gx> any 
edge cd with d £ S is directed toward c. 

In this section we denote by C>k the set of chords with length at least k in D n . In several steps we will 
show that, whp, C>k is strongly connected as a subdiagram of D n as k tends to infinity. We start with 
finding a large clique in C n , which will act like a progenitor of the ‘giant’ strong component in Go n - 

Definition 5.1. A balanced clique of size m is a clique such that there is exactly one endpoint in each of 
the blocks [ 1 , r], [r + 1 , 2 r], • • • , [( 2 m — l)r + 1 , 2 mr], where r = [n/m \. 

Let B m denote the number of balanced cliques of size m in C n . We next show that B m is positive whp 
for some large m. 

Lemma 5.2. Let e be a positive constant. If m < [(2 + e)^ 1 (logn) _ 1 // 2 n 1 ,/ 2 J, then there exists a balanced 
clique of size m in C n whp, i.e., P(S m > 1) —> 1 as n —>• oo. 

Proof. Let r = |_ n/m\ , and let I t denote the block [(t — l)r + l,tr\. We have n/m > (2 + e)y/nlogn, so 
r > (2 + e)y / nlogn — 1. For 1 < t < m, let Aft) be the event that there is no chord (x, y) in C n such that 
x € It and y € It+m- The event {B m = 0} lies in the union U 7 fL 1 A{t). Let p t := P(A(t)). By the union 
bound, 

m 

P (Bm = 0) < y ^Pt, 

t =1 
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and by symmetry all the pt s are the same. Thus, P (B m = 0) < mpt . It is enough to show pt = o(l/m). 
For 0 < j < r/2, we denote by B(t,j) the event that there is no chord from I t to / i+m , and there are 
exactly j chords with both endpoints in I t . Note that L>(t, j)’s are disjoint and A(t) = B(t. j). Thus, 


Pt 


[r/2\ 

3=0 



( 2 n- 2 r + 2 j-l)\\ 
3h ( 2 n — 1 )!! 


Let fj be the term in the above sum corresponding to index j. For j < [r/ 2j we have 

fj = ( 2 j + 2 ) ( 2 n -3r + 2 j + 2 ) _ ( _ r 

fj+i 0 - 2 j)(r - 2 j - 1 ) V 2 n — 2 r + 2 j + 1 

It is easy to see that this ratio is increasing with j. Thus, the sequence is unimodal; there is an 

index k such that {fj} is increasing for j < k and it is decreasing afterward. To find this k, we need to 
solve the equation fj/fj+ 1 = 1. It is easy to see that the index k of the maximum term goes to infinity 
and k = o(r). For 1 < j <r (here a<C 6 means a/b —> 0), using (5.1) we write 

fj ^ 4nj ^ 4 j 

fj+i r 2 (2 + e) 2 logn" 

From this we conclude that the maximum term ff occurs when k is asymptotic to (logn)/2. Let us find 
an upper bound for /&. We have 



fk = 


2 k 


( 2 k — 1 )!! 


T ! 


i 


2 n — 2 ?'V ., ( 2 n — 2 r + 2 k — 1 )!! 

r- 2 *J (r “ 2 t)l —(SrrTyn— 
( 2 n — 2r)H (2n - 2r + 2 k - 1 )!! 


(r — 2 k\) 2 k kl (2n-3r + 2 k)U 


( 2 n- 1 )!! 


2 k k\ 


/2k-l 


( r—2k—l 


n(r -0 ■ n ( 2 n — 2 r — i) 


i =0 


i =0 


/ r—k—1 

n 

V i =0 


1 


2 n — 1 — 2 i 


(5.2) 


Now we find an upper bound for each of the four terms in the product above. Using Stirling’s formula, we 
obtain 


1 


< — 


2 k k\ ~ \ 2 k; 

For the second term in (5.2), we have 


2k—1 

n ( r - *) - r2k - 

i =0 
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For the third term in (5.2), we have 


r—2k —1 


r—2k —1 


)[ (2n - 2r - i) = (2n) r ~ 2k JJ 1 


i =0 


i=0 


2 r + i 
2 n 


= {2n) r ~ 2k exp 


)— 2k—1 


J2 log i 


= (2n ) r ~ 2k exp ( + O ( — + 


L i=0 
5 r 2 


2 r + i 
2 n 


(rk r 3 


4n 




n n. 


Finally, for the last term in (5.2), we have 

r—k—1 r—k—1 


n 

i=0 


1 p")— n 

i=0 v 


2 n — 1 — 2 i 


-1-2 z 


<(2„r-’n 1 (i + |l 


*=o 


2 n / 


= ( 2 „r+‘exp[r_ + 0(^ + 3 




r/c ?’ 3 


Combining (5.2) and the four upper bounds for the terms on its right side, we get 


ft < I ^iV’' 2 / 2 " 




4nk ) V V n n 2 ) / V 4 nk 

er 2 \ 


Now letting ^(.t) = xlog (|^rj and taking the derivative, we get 


\ er i 

9 W = 4^ “ L 


Thus, g(x) takes its maximum when x = ?’ 2 /(4n), and consequently, the maximum of occurs for 

x = r 2 /(4n). So, 

2 \ k 

\ g— r 2 /In ^ gr 2 /4)2g—r 2 /2n _ g— r 2 /in 


, 4nfc / 

Since = Ej=o J /j < I 1 + ( r / 2 )] fk, we have 
Pt = O log ne _r ’" /, l 4ri l^ 
and consequently, 


P(£> m = 0) < m ■ pt = O (ne r2 /l 4n )^ = O ^n 1 4 } ^ , 


which finishes the proof. 


□ 
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Let T n denote the tournament chosen uniformly at random from the set of all tournaments on n vertices. 
In other words, T n is the directed graph obtained from the complete (undirected) graph K n by orienting 
the edges independently of each other with probability 1/2 in each direction. It is well known that T n is 
strongly connected whp. In fact, a result of Janson [18, Theorem 1] states that there are many directed 
Hamilton cycles in T n whp. In the next lemma we use this fact and Lemma 5.2 to show that £> m is 
strongly connected for sufficiently large m whp. Subsequently, in several steps, we will show that the same 
event holds whp for any m —> oo. Let A(k, n ) denote the event that there is a strongly connected balanced 
clique of size k in D n . 


Lemma 5.3 (Step 1). Whp, the subdiagram £>^3/5] is strongly connected as n —>• oo. 

Proof. Let m = [n 3 ^ 5 J and ml = \n 9 / 20 J. By Lemma 5.2 and the fact that a random tournament is 
strongly connected whp, the event A(m',n) holds whp. Let c be a chord in D n whose length is at least m. 
Since £(c) > m, each of the two arcs determined by c contains at least (m — 2r)/r of the blocks I \,..., I- 2 m', 
where r = \n/m'\ and I t = [(t — 1 )r + 1 ,tr\. As a result, c intersects at least, say, m/r — 2 chords of any 
balanced clique of size m!. If B is a stongly connected balanced clique of size m' in D n , then c belongs 
to the same strong component containing B unless either c over-crosses B or it under-crosses B. Thus, 
the probability of c not belonging to the strong component of B is at most 2(l/2) m//r_2 . Consequently, 
the probability that there is a chord of length at least m that does not belong to the strong component 
containing B is 0(n ■ 2 -m / r V which goes to zero since m/r ~ n 1 / 20 . In other words, conditioned on the 
event A(m',n ), which occurs whp, all the chords of length at least m lie in the same strong component 
whp. This finishes the proof. □ 

Lemma 5.3 tells us that the subdiagram consisting of chords of length at least n 3 /' 5 is strongly connected 
whp. By Lemma 4.4, whp, the number of chords with length at most n? +£ is smaller than (1 + e)n^ +£ for 
any fixed e > 0. These two results together imply that, whp, there is a strong component, which deserves 
to be called the giant component , of size n — o{n). Next, using the previous result, we will show that chords 
of length at least n 1 / 3 belong to this giant component. We start with the following lemma. 

Lemma 5.4. As n ^ oo, whp, there is no block of length 3|_b 3//5 J that contains at least [n 1 / 3 ] chords. 

Proof. Let t := 3|_n 3 / 5 J and s : = [n 1 / 3 ]. Let Ij denote the set {j, j + 1,..., j + t — 1} of size t, and p(Ij) 
denote the probability that Ij contains at least s chords. By symmetry, p(Ji) = p(Ij) for any i and j and 
hence the probability that there is a block of length t with at least s chords is bounded above by 2 np{I\ ) 
by the union bound. We have 


2 np{I \) < 2 n 


t 

2 s 

./.2s 


< 2 n 


< 


(2s)! 

(—V 

\2ns J 


( 2 s - 1 )!!( 2 n - 2 s - 1 )!! 
( 2 n — 1 )!! 

1 ^—Lf — 2 n — 
n s s! 2 s n s 

/ \ n 1 / 3 

< (l8n" 2/15 ) -> 0, 


where the third inequality follows from Stirling’s formula, and the fourth one is obtained by plugging in 
the values of s and t. This concludes the proof. □ 


Lemma 5.5 (Step 2 ). As n —>• oo, whp, the subdiagram £ > 3 i n i/ 3 j is strongly connected. 
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Proof. Let k = 3 1_77 1//3 J and m = |_n 3//5 J- Let E be the intersection of the events that the subdiagram 
C> m is strongly connected and there is no interval of length 3m. that contains at least k/ 3 chords. By 
Lemma 5.3 and Lemma 5.4, the event E holds whp. 

Conditioned on E, there is no chord c in D n with k < £{c) < m such that c has fewer than k/3 neighbors 
in C> m - On the other hand, if c is a chord with at least k/3 neighbors from £> m , then the probability 
that c over-crosses (or under-crosses) C> m is at most 2 _fc / 3 . Therefore, conditioned on E, the probability 
that there is a chord c with k < £(c) < m that does not belong to the giant component is O (n ■ 2 _fc / 3 ), 
which goes to 0. Consequently, since E holds whp, C>k is strongly connected whp. □ 

Lemma 5.6 (Step 3). As n —>• oo, the subdiagram C> 2 \ 0 gn is strongly connected whp. 

Proof. Let k = Sl/i 1 / 3 ). By Lemma 5.5, the subdiagram £>*. is strongly connected whp. Thus, it is enough 
to show that, whp, there is no chord c with 21 ogn < £{c) < k over-crossing (or under-crossing) C>k- 

First we show that there is no block in [2n] of length 3 k that contains at least 4 chords. Indeed, the 
expected value of the tuples (/, c 1 , 02 , 03 , 04 ), where I is a block of length 3 k that contains the chords 
ci,c 2 ,C 3 , and C 4 is 

^ (2 n - l)(2n - 3)(2n - 5)(2n - 7) ” ° (" )' { °'° ] 

Let E' be the event that every chord c with 21ogn < £(c) < k has at least £{c) — 4 neighbors from C>k- 
By (5.3), the event E' holds whp. On the other hand, conditioned on E', the probability that there is a 
chord c with 21 ogn < £(c) < k that over-crosses (or under-crosses) £>& is 0 (n 2 _21ogn ) = 0 (l/n), which 
finishes the proof. □ 


Lemma 5.7 (Step 4). Whp, there is no block of length |_51ognJ that contains two chords. Consequently, 
whp, there is only one strong component of size larger than 1 . 

Proof. Let k = |_51ognJ and X n be the number of blocks of length k containing two chords. Given a block 
/ of length k, the probability that I contains two chords is at most 

S3!! (k*\ 

(2n-l)(2n-3) V» 2 / 

Since there are 2 n blocks of length k, the probability that there is one containing two chords is 0(k 4 /n) by 
the union bound, which goes to 0. In particular, whp, no two chords of length at most 21ogn cross each 
other. Combining this with Lemma 5.6, whp, there is only one strong component of size larger than 1. □ 


The last step is to find the number of strong components of size 1. Let F be the event that all the 
chords of length at least 2 log n belong to the same strong component and there is no block of size 5 log n 
containing two chords of length smaller than 21ogn. By lemmas 5.6 and 5.7, the event F occurs whp. 

Now, let c be a chord of length k < 2 log n. Conditioned on the event F, the chord c has k neighbors, all 
from £> 2 iogn- hr that case, c does not belong to the giant component if and only if either it over-crosses all 
its neighbors or it under-crosses all its neighbors. The probability that it over-crosses all k of its neighbors 
is 2 -fc , and likewise the probability that it under-crosses all k of its neighbors is 2~ k . Let (j(i) be the the 
indicator random variable which takes the value 1 if (i,i + j + 1) is a chord in D ni and this chord either 
over-crosses all of its neighbors or it under-crosses all of its neighbors. Conditioned on the event F, the 
number of single-chord strong components is n GW- 
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Lemma 5.8. For any positive integer k < 2 log n, we have 


2n I / 2,2 \ 

E e («’ : )] = 5EU + °(t)- 


2—1 


n J 


Proof. Let pi be the probability that the chord (i, i + k + 1} exists, it has exactly k neighbors, and it either 
over-crosses or under-crosses all of its neighbors, and let p 2 be the probability that (i , i + k + 1} exists and 
it has fewer than k neighbors. Then, we have p\ < E[£fc(?’)] = P(0t(i) = 1) < pi + P 2 - If (*, i + k + 1) exists 
and has fewer than k neighbors, then there must be a chord with both endpoints in [i + l,i + k\. Hence 
the probability p 2 is bounded above by (*)( 2n — 5)!!/(2n — 1)!!, so p 2 = 0{k 2 /n 2 ). On the other hand, 


1 


Pi = 


2n- 1 


(1 - 0{k 2 /n 2 )) • ^ rT , 


where the first factor accounts for the existence of the chord c = (i, i + k + 1), the second one accounts for 
c having exactly k neighbors, and the last one accounts for c under-crossing or over-crossing its neighbors. 
Combining the above, we get 

E [C*(*)] = ^ + 0(k 2 /n 2 ). 

Summing over all i, we obtain 


2 n 

2—1 


1 

2 k ~ 1 


+ o 



□ 


Corollary 5.9. For any oj = uj(n) that tends to infinity, whp, there is no chord c of length at least oj such 
that c is a single-chord strong component. 


Proof. By the previous lemma, we have 


21ogn 2 n 2 log n 

E E e k>«]= E 

j=uj i =1 j=u 




- + 0(j 2 /n ) 


0(2 u + log 3 n/n). 


Hence, Yli=i Cjif) = 0 whp. Coupling this result with the fact that F occurs with high probability, 

we get the desired result. □ 


Theorem 5.10. The number of single-chord strong components of D n converges in distribution to a Pois¬ 
son random variable with mean 3. Whp, other than the single-chord strong components, there is only one 
strong component, which is the giant component. 

Proof. The proof is an amalgamation of the previous results. We will call a strong component of size 1 
a trivial component, a chord of length smaller than 21ogn a small chord, and a chord of length at least 
2 log n a big chord. Let ui be an integer valued function of n that tends to infinity as n tends to infinity. By 
Lemma 5.6, Lemma 5.7, and Corollary 5.9, whp, there is a giant strong component that contains all the 
chords of length at least ui and all other components are trivial components. Lemma 5.7 says that whp 
there is no small chord c crossing fewer than £(c) big chords, where £(c) denotes the length of c. Let H be 
the intersection of all these high probability events. Conditioned on H , the trivial components correspond 
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to small length chords that have as many neighbors as their lengths and they either over-cross all their 
neighbors or they under-cross all their neighbors. We define binomially distributed random variables 


Vi ~ Bin 


where Li denotes the number of length i chords in the diagram, p$ = 1 , and pi = 2 1 1 for i > 1 . Conditioned 
on H , the number of trivial components is given by 770 + • • • + Vu-i ■ Now fix a positive integer k and define 


<t>n,k{t) ■= E 


exp 


lt Y V] 

3 =0 



‘ / * \ 


E 

exp it Vj 

To, • • •, Tfc 


[ V i=o J 



Note that 770 ,,Vk are independent conditioned on Lq, ..., Lk- Thus, 

k 

[J E[e^ | Lj] 

3 =0 

k 

JJ E [e u Bin ( L J ’Pi > | L j ] 

3 =0 

k 

Y[(e lt Pj + 1 -Pj) Lj - 
3 =0 


Using Theorem 4 . 2 , 



k 


k 

11 

e 

Y[(e lt Pj + 1 - Pj ) L i 

—>■ E 

U(*p j + i-Pj) T > 


3 =0 


3=0 


where TVs are independent copies of a Poisson random variable with mean 1 . On the other hand, 


E 


Yl(e lt Pj + 1 - Pj? 
3 =0 


k k 

n E K eit Pj + 1 - pj ) Tj ]=n ex p ( e%t Pj - pj) 

3= 0 3= 0 


= exp 


V-i)E n 

j =0 


The calculations above show that (j) n ,k{t) ex P [ 3 (e rf — 1 )] as k —>• 00, where exp [ 3 (e®* — 1 )] is the 
characteristic function of a Poisson random variable with mean 3 . Thus, 

Vo 4 -f Vk Pois( 3 ) 

as k —>• 00. We conclude the proof by taking k to infinity first and then letting uj = k. □ 


6 Growth of a random chord diagram 

In the previous sections we considered the static random chord diagram C n . In this section we discuss a 
couple of dynamically growing random chord diagram models. In both models, we start with no chord and 
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add chords one at a time randomly. In n steps, we obtain a chord diagram with n chords. We show that 
both models produce a uniformly random chord diagram after n steps. Once this is established, C n can 
be viewed as the nth step snapshots of two different random processes. 

With the associated intersection graphs, the random chord diagram processes can also be viewed as 
graph processes. Random graph processes are studied widely in the fields of random graphs to understand 
the dynamics of random graphs. The best known example is the graph process G n , where one starts with n 
labeled vertices and no edges, and at each step adds an edge choosing the edge uniformly at random from 
all non-edges. The Erdos-Renyi graph G(n,m) can be viewed as a snapshot at time m of the process G n . 

After we define both models, we show that the two models are actually not much different. We use 
these growth models to give another extension of the result by Flajolet and Noy (Theorem 3.2). 

6.1 Continuous model 

Consider the following growth process. Initially we have a circle with no chords and no endpoints on it. 
At step 1, we add a chord where we do not distinguish any two chords at this step. Thus, after step 1, we 
have a chord diagram with one chord. At step 2 , there are three ways to draw the second chord; either 
the second chord does not intersect the first one, which can happen in two ways, or it intersects the first 
one. More specifically, the first chord separates the circle into two arcs, and either both endpoints of the 
second chord lie on the first arc, or they both lie on the second arc, or they lie on different arcs. One of 
these possibilities is chosen uniformly randomly. In general, after step k, there are k chords and 2k disjoint 
arcs. For the next chord, there are ( 2 9 fc ) +2 k = ( 2fc 2 ^ 1 ) ways of choosing the arcs on which the endpoints 
lie, and one of them is chosen uniformly at random. For convenience, which will be clear later, one of the 
endpoints of the first chord is labeled with 1 after it is created. Basically, it will serve as a reference point. 

Remarks. 

(1) In this process, at each step, what matters is only the selection of the arcs for the endpoints of the 
chord to be added. We do not care about where exactly on the arc an endpoint is situated. 

(2) We do not label the endpoints during the process except one of the endpoints of the first chord. 
However, after each step we know the relative ordering of the endpoints. 

Let Uk denote the chord diagram obtained after the fcth step of the above process. At this time, we 
can label the endpoints of the chords in Uk starting with the already labeled endpoint 1 and labeling the 
jth endpoint we encounter with j as we traverse the circle clockwise. The following lemma asserts that C n 
is the snapshot of the above process after step n. 

Lemma 6.1. Uk is uniform among all the chord diagrams with k chords on the set of endpoints [2k]. 

Proof. Let C be a chord diagram with k chords on [2k]. We need to show that P (Uk = C) = 1/(2 k — 1)!!. 
Let ci,..., Cfc be a labeling of the chords of C such that the chord containing endpoint 1 is labeled as ci. 
In the evolution of Uk, the chords C 2 ,..., Ck can appear in any order. Let X 2 ■ ■ ■ Xk be a permutation of 
the chords C 2 ,... ,Ck, which represents a particular order of appearance in the evolution. Conditioned on 
{cci,..., Xi- 1 }, the probability that Xi occurs at step i is 1/( 2 *./ 1 ) • Thus, the probability that Xi occurs at 
step i for all i € [ 2 , k] is 

2 i — l\ _1 2 k ~ x 
. 2 ) ~ ( 2 k — 1 )! ’ 
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Since there are (k — 1)! orderings of C 2 ,..., Ck, we have 




1 

( 2 k — 1 )!!' 


□ 


Next we will prove an extension of Theorem 3.2 which says that C n is monolithic whp as n —» oo. We 
call m a switching point if U m is monolithic and U m +1 is not. The next lemma shows that no switching 
point occurs after some point whp. 


Lemma 6.2. For any oj —>• oo, whp, there is no switching point greater than uo . 


Proof. Let be the indicator random variable that takes the value 1 if m is a switching point, that is, 


fm 


1 , if m is a switching point 
0 , otherwise. 


We want to find an upper bound on the expected value of £ m . Let A(m,k ) denote the set of monolithic 
chord diagrams with a total of m chords with k of them being simple. Let A(m) denote the union of 
A(m, k) for k > 0. We have 


E&n] = P(£m = 1) 


1 

(2m - 1)!! 


= J2 nUm = C)-F(U m+ i#A(m + l)\U m 

CeA(m) 

|_2m/3j 

E E ^{Um +1 ^ A(m + \)\U m — C ). 

k =0 C£A(m,k) 


c ) 


( 6 . 1 ) 


Note that the upper bound for k in the outer sum above is |_2m/3j since the number of simple chords 
in C cannot exceed 2m/3 provided that C E A(m). To see this, let C € A(m , k). The non-trivial component 
of C has m — k chords and the endpoints of these chords determine 2 m — 2k intervals. Since each such 
interval can contain at most one simple chord, we have k < 2m — 2k and consequently k < 2m/3. 

Now suppose U m = C , where C E A(m,k) and C. In this case, U m+ \ A{m + 1) if and only if the 
(m + l)st chord lies in one of the ( 2 m — 2 k) intervals determined by the non-trivial component and the 
same interval contains a simple chord. For each simple chord, there are 6 possible ways to add a new chord 
that causes U m +i not belong to A{m + 1). (This is the number of ways to choose two endpoints from three 
arcs determined by the two endpoints of the existing simple chord.) Thus, 

3A* 

F(U m+1 0 A(m + 1 )\U m = C)) = 7 ^ T v < -2 ■ 

Next we want to bound the number of summands of the inner sum in (6.1), namely the size A(m, k). 
Each C E A(m, k ) contains a connected component of size m — k, which contains the endpoint 1. Let f(n) 
denote the number of connected chord diagrams with n chords. Now the number of terms in the inner 
sum in (6.1) is /(m — k)( 2 m ~jf 2k ), since we can locate the simple chords in ( 2m ~ 2fc ) wa ys. Hence, using the 
trivial upper bound ( 2 m — 2 k — 1 )!! for /(m — k), 


E[£m] < 


1 

(2m- 1)!! 


|2m/3j 

( 2 m — 2 k 

k= 1 


1 )!! 


( 2 m — 2 k\ 

V k J 


3k 

m 2 


( 6 . 2 ) 


21 







Lastly, we need to bound the sum in (6.2). Let be the term corresponding to index k. Taking the 
quotient of two consecutive terms gives 


efc (2m — 2 k)(2m — 2 k — l ) 2 

ek~ i-i ' (2m — 3k){2m — 3 k — 1)(2 m — 3k — 2) — 

Using (6.3) we find {k — 1)< e\. On the other hand, 

e\ = (2m — 3)!! ( ) —^ < (2m - 1)!!^. 

\ 1 J m- m- 

Using equations (6.2)-(6.4), we get E[£ m ] < 3e/m 2 . Summing over all m, we obtain 

OO OO q 

m=u) m=uj 

which finishes the proof. 

Corollary 6.3. Let m —>• oo. Whp, all the diagrams in {U n : n > m} are jointly monolithic. 
Proof. This follows immediately from Theorem 3.2 and Lemma 6.2. 


(6.3) 


(6.4) 


□ 


□ 


6.2 Discrete model 


Now we present another model for an evolution of a random chord diagram. As in the previous section, 
we add the chords one at a time. However, in this case, we start with 2 n given points on a circle, labeled 
1 through 2n clockwise in increasing order. These points constitute a universal set for the endpoint set 
of a chord diagram obtained during the process. At the first step we create the first chord by choosing a 
partner uniformly at random for the endpoint 1. Then, at each later step we add a chord by choosing two 
of the yet unused endpoints and pairing them up. Thus, at step k, we add a chord by joining two of the 
2n — 2k + 2 endpoints, which do not belong to any of the first k — l chords. We denote by C' k the random 
chord diagram obtained after the fcth chord is added in this process. Now we define a relabeling operation 
for chord diagrams. 


Definition 6.4. (r operator ) Let C be a chord diagram with k chords whose endpoint set {ii,..., * 2 fc} 
is a subset of [ 2 n], and let i\ < Z 2 • • ■ < i 2 k■ We denote by r(C) the chord diagram obtained from C by 
relabeling i t as t for every t e [ 2 k]. 

Example. Let n = 5 and let C be the chord diagram consisting of the chords (2, 8 ), (3, 5), and (4,9). Then 
t(C ) is the chord diagram consisting of the chords (1,5), (2,4), and (3,6). 

Lemma 6.5. t(C i (.) is uniformly distributed among all the chord diagrams with k chords on the endpoint 
set [ 2 k]. 

Proof. Let C be a chord diagram on [2k] with k chords. Let ci,..., Ck be the chords of C where c\ contains 
the endpoint 1. For any endpoint set I = {1 = ■ ■ ■ ,i 2 k} C [2n], there is a unique chord diagram C 

on / such that t(C') = C. On the other hand, for any C with t(C') = C, 


nc' k =c') 


i 

2 n - 1 


n 


3 = 1 


k-j 

( 2 "U) 
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since the partner of endpoint 1 in C is chosen in the first step with probability l/(2n — 1), and at step 
(j + 1) we can choose one of (k — j) chords out of ( 2re “ 2 ^) chords for j € [A; — 1]. Since there are (^Zi) 
diagrams C such that t(C') = C (one for any choice of the 2k — 1 endpoints), we have 

P(r(C')=C) = 

which gives the desired result. □ 

By applying the r operator to the chord diagrams obtained during the process, we get a sequence of 
chord diagrams, t(C[), ... ,T(C' n ), where r{C ' k ) is uniformly distributed over all chord diagrams with k 
chords on [2 k\. Now, we want to show that this sequence has the same probability distribution with the 
sequence of the first n diagrams obtained in the previous section. We write X = Y when the two random 
variables X and Y have the same distribution. 

Lemma 6.6. (r(C(),..., r(C ' n )) = (U\, ..., U n ), that is, 

U n ) 

Before proving the lemma, we introduce another operation in chord diagrams. 

Definition 6.7. (ip operator) Let C be a chord diagram with n chords on the endpoint set [2n]. Let 
r : {chords of C} —>• [n] 

be a bijection such that r{c\) = 1, where c\ denotes the chord containing the endpoint 1. Hence, r defines 
a labeling of the chords such that the chord r -1 (/c) is labeled with k. Let Cj be the subdiagram consisting 
of the chords {r _1 (l),..., r _1 (j)}, the set of chords of C labeled with {1,..., j} under r. Then we define 

<Pnj( c ) = T ( C j)- ( 6 - 5 ) 

We supress the subscript n and the coloring r when there is no danger of confusion and write <Pj(C) instead 
of p r n j(C). In words, <pj deletes the last n — j chords and relabels the endpoints of the remaining chords 
with the labels in [2 j], respecting their previous relative ordering. 

Example. Let n = 7. Let C be the chord diagram on the endpoint set [14] with the chords (1, 5), (2,14), 
(3,8), (4,9), (7,12), (6,13), and (10,11). Let r be the labeling of the chords with the labels 1 through 
7, in this given order. Then, ip r 7 4 (C) is a chord diagram of size 4, and its chords are (1,5), (2,8), (3,6), 
and (4, 7). 

Proof of Lemma 6.6. Let D\,... ,D n be given chord diagrams such that Di is a chord diagram with i 
chords on [2 i\. We want to show that 

P(r(C() = £>!,..., r(C' n ) = D n ) = ¥(Jh = D u ..., U n = D n ) (6.6) 

For brevity, we write D for (D \,..., D n ), r(C') for ( t{C [),..., r(C'( l )), and U for (U \,..., XJ n ). In both 
processes, {Uk} k=1 and {C k } k=1 , we label the chord created at the jth step with j. We denote by r c the 
chord labeling of C' n obtained during the process {C'(.}^' =1 , and by r u the chord labeling of U n obtained 
during the process {Uk} k= i- By definition, we have (J = r(CZ) and p^jiUn) = Uj for all j € [n]. 


2 n - 1\ 1 

2 k - 1 J ’ 2n — 1 


fc-i 

n 


k- j 


= 1 PV 5 ) ( 2k ~ l W 
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Let L = L(D) be the set of labelings of the chords of D n by [n], such that the chord containing endpoint 
1 is labeled with 1 and c Pj{D n ) = Dj for all j E [n — 1]. Note that, if L = 0, then there cannot be any 
evolution in either model that produces D. Consequently, both sides of Equation (6.6) are 0 in that case. 

Claim : The sequence t(C / ) is the same as D if and only if C' n = D n and r c € L. Similarly, the sequence 
U is the same as D if and only if U n = D n and r u eL, 

Proof of the claim: We prove only the first part of the claim since the second part is very similar. 
Suppose first that t(C') = D. In this case, the last components of the two vectors must be the same and 
hence C' n = D n . Consequently, 

Pn,j( D n) = Pnj(C' n ) = t(Cj) = Dj for all j € [n], 
from which it follows that r c € L. Conversely, if D n = C' n and r c € L, then obviously 


D j = ^ j (D n ) = ^ j (C' n ) = T(C' j ), 




which finishes the proof. □ 

Note that a necessary condition for two diagrams having the same chord labeling is that the diagrams 
are the same. Thus, the above claim could indeed be stated as 

(i) t(C') = D r c E L, and 

(ii) U = D r u E L. 

Now, in order to find the probabilities on either side of Equation ( 6 . 6 ), we need to find the probability that 
a given r E L is produced during the corresponding process, i.e., the probabilities P(r c = r) and P(r u = r ). 
Let r be a labeling in L. The two labelings r c and r are the same if and only if the chord r~ l {i) is created 
at the zth step of the process {C' k }^ =1 for all i E [n]. Therefore, we need to find the probability that the 
chord created at step i is the same as ?’ - 1 (i) conditioned on the previous i — 1 chords. Now, the chord 
labeled 1 is created at the first step with probability l/( 2 n — 1 ), and conditioned on the first i — 1 chords, 
the chord labeled i is created at the zth step with probability 2/(2n — 2 i + 2)(2 n — 2i + 1) for i > 1. Thus, 


P(r c = r) 


1 

( 2 n - 1 ) 


n 



2 

2 i + 2 )( 2 n — 2 i + 1 ) 


2 i-i 

{2n- 1 )!’ 


and consequently 


P(r(C') = D) = \L 


1 

(2 n - 1) 


n 


n 


(2 n 


2 

2 i + 2 )( 2 n — 2 i + 1 ) 


\L\ ■ 2 n ~ x 
( 2 n — 1 )! ’ 


We argue similarly for the right side of Equation ( 6 . 6 ). More specifically, for the equality Uj = Dj for 
all j E [n] to hold, we must have r u = r for some r E L. In D n , or in U n , we do not have to keep a record 
of the endpoint labels other than the endpoint labeled 1. As long as we know the endpoint labeled 1 and 
the positions of the endpoints of the chords relative to each other, we can uniquely determine the endpoint 
labeling for D n . Now, given a labeling r in L, we need to find the probability that r is obtained during 
the process {C/fc}^ =1 . For the first chord there is no restriction, so it will be created with probability 1. 
The chord labeled 2 in r is created with probability 1/3 at the second step of the process {14} since there 
are three possibilities after the first chord is drawn. In general, if k chords are already created in {L4 }]? =1 
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matching the labeling r, the chord labeled as k + 1 will be created with probability 
step. Thus, r is obtained with probability 




at the ( k + l)st 


S (T) f 2 "- 1 ) 1 ’ 

and consequently, 

Thus, P(t(C') = D) = P(U = D) as desired. □ 


We extend the definitions of simple chords and monolithicity to chord diagrams obtained during the 
second process {C' k \'f. =l . We say that C' k is monolithic if r(C ' k ) is monolithic. Combining Corollary 6.3 
with Lemma 6.6, we obtain the following generalization of the Flajolet-Noy result (see Theorem 3.2). 

Corollary 6.8. Let u(n) be a function of n that tends to infinity as n tends to infinity. Whp, all the 
chord diagrams C' m for oj(n) < m < n are monolithic as n —>• oo. □ 


7 Concluding remarks 


Let C be a chord diagram and Gc be the corresponding intersection graph. We define the cliques and 
independent sets of C as the preimages of the cliques and independent sets in Gc, respectively. Hence, a 
clique in Gc corresponds to a set of pairwise intersecting chords in C. Likewise, an independent set in Gc 
corresponds to a set of pairwise non-intersecting chords. The clique number of C is the largest k for which 
there is a A;-clique in C. Similarly, the independence number of C is the largest k such that there is an 
independent set of size k in C. 

Recall from Section 1 that Chen et al. [12] and Baik and Jenkins [4] defined an r-crossing and an 
r-nesting as follows. A set of r chords (aq, yf ),..., (x r ,y r ) is an r-crossing if aq < ■ ■ ■ < x r < y\ < ■ ■ ■ < y r 
and it is an r-nesting if aq < ■ ■ ■ < x r < y r < ■ ■ ■ < y\. With this definition, an r-crossing is the same as 
an r-clique. On the other hand, an r-nesting is an independent set of size r but the converse is not true, 
that is, not every independent set in C is a nesting. 

Now let u}(C n ) and a(C n ) denote the clique number and the independence number of C n , respectively. 
Also, let a'(Cn) denote the nesting number of C n , the largest number r for which there is an r-nesting. 
Using a result of Baik and Rains [5, Theorem 3.1], Chen et al. [12, Remark 5.6] remarked that 


lim P 

71—»• OO 


u(C n ) - V2n, 
(2n) 1 / 6 


< 2x 


F(x), 


(7.1) 


where F(x) is the GOE Tracy-Widom distribution function. Moreover, they proved that a'(C n ) and iu(C n ) 
are equally distributed, so (7.1) holds if we replace co(C n ) by a'(C n ) in the equation. On the other hand, 
we have a(C n ) > a'(C n ) since each nesting is an independent set. 

Now define X r as the number of independent sets of size r in C n . We have 


E[A r ] 


( 2 2 (?)(2n-2 r-l)!! _ _j^_ /2r\ 
(2 n — 1)!! r + 1 \ r ) 
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since for each set of 2 r points there are ( 2 r r )/(r+1) ways to pair them without any intersection (this number 
is the rth Catalan number). For r = e\/2n, simplifying this expression and using Stirling’s formula, we get 


E[X r 


2irr 2 


2e^n 


0 —r 2 /2n 


This equation, together with (7.1) and the fact that a{C n ) > a'(C n ), show that, whp, 
y/2 n — tn 1//6 < a(C n ) < eV^n 

for any t —>• oo. We conclude this work with two open questions. 

Question 1. Is there a constant j3 such that a{C n )/y/n —>• f3 in probability? 

Question 2. What is the asymptotic distribution of a(C n ) ? 
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